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JHj; Abstract 

£N) ■ Annihilation of cosmologically distributed dark matter is predicted to produce a potentially ob- 

> : 

ON , servable flux of high energy photons. This signal is predicted to be virtually uniform on the sky 

(N . 

but, in order to be identified, must be extracted from various Galactic and extragalactic back- 
ff) ■ grounds. We consider three techniques for extracting this signal from the backgrounds: spectral 

o : 

ON . discrimination, angular discrimination, and distribution discrimination. We analyze the first two of 

O ■ 

these with the Fisher Matrix formalism to obtain projections for constraints from the Fermi satel- 
lite. The third technique exploits the fact that the number of photons from extragalactic blazars is 
drawn from a distribution which is far from Poisson. Using a toy model, we show that knowledge 
of this distribution enhances one's ability to extract the dark matter signal, while ignorance of it 
can lead to the introduction of a large systematic error. 
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I. INTRODUCTION 



There is abundant evidence that non-baryonic dark matter is responsible for many gravi- 
tational effects observed over a wide range of scales jlj . Experimental efforts are now focused 
on identifying the particle nature of this substance. A particularly interesting possibility is 
that the dark matter may take the form of a weakly interacting massive particle (WIMP) 



which could be observed in underground direct detection experiments [2 



produced at accelerators such as the Large Hadron Collider jfl . A third class of experimen- 
tal approaches to this problem, known as indirect detection, consists of experiments which 
search for the products of dark matter annihilations, including neutrinos, cosmic rays, and 
gamma rays. 

A new and exciting range of possibilities for the indirect detection of dark matter has been 
opened with the launch of the satellite-based Fermi gamma ray space telescope (formerly 
known as GLAST) [?], H]. Fermi is sensitive to photons in the 100 MeV-300 GeV range, 
and benefits from far greater exposure and superior angular and energy resolution than 
its predecessor, EGRET. The flux of gamma rays produced in dark matter annihilations 
depends on both the WIMP's annihilation cross section, mass, and dominant annihilation 
modes, and on the spatial distribution of dark matter. An advantage of indirect detection 
relative to direct detection efforts is that the annihilation cross section probed is in many 
models directly related to that responsible for the primordial abundance of dark matter. 
Although there is variation from model-to-model, annihilation cross sections of order (erf) ~ 
3 x 10 -26 cm 3 sec -1 are common across a wide range of dark matter candidates. If the dark 
matter annihilation cross section is of this magnitude, Fermi and ground-based gamma ray 
telescopes will likely detect many photons from dark matter. The challenge lies in separating 
this signal from astrophysical backgrounds, which are likely to be tens to thousands of times 
as large, depending on the energy bin and direction on the sky. 

A general strategy for optimizing the chances of detecting dark matter is to combine 
angular and spectral features to disentangle the signal from backgrounds. The details of 
how this is best done, however, depend on the specific target one is focusing on. For 
example, in previous work 9|, three of us discussed techniques for separating dark matter 
annihilation products from astrophysical backgrounds in the Galactic Center region. The 
angular features of the signal from the smooth Galactic halo, or from unresolved sub-halos, 



and/or be 
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may also provide useful in 



multipole space 
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ormation for signal/background discrimination, either in real or 

,Q,Q. 



A different situation holds for the diffuse gamma ray flux resulting from the integrated 
sum of all extragalactic dark matter halos (the cosmological signal). To be identified, this 
signal will have to be separated from the extragalactic background due to unresolved gamma 
ray sources, such as blazars, as well as from residual contamination from the Galaxy. This 
procedure is delicate and, not surprisingly, the astrophysical interpretation of the results in 
the case of EGRET data has led to very different conclusions, see e.g. Q,Q,0,Q- Also, 



when removing the "Galactic background" one must account for the DM signal: Under some 



common assumptions (universality o 
dominate over the extragalactic one 



the DM profile in the halos) this signal is expected to 
JJ, H9(. Still, the cosmological DM signal is subject to 
very different systematics compared to the Galactic one and encodes a lot of information 
on the cosmological properties of DM, justifying a deeper study. Apart from the angular 



distribution of both signal and background 
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251 ] , there remain two potential 



differences which can be exploited to extract the signal: 

• The energy spectra of the signal and background are likely to be quite different. This 
difference has often been exploited to determine how well the signal can be extracted. 
In this paper, we use the Fisher Matrix formalism to simplify this task. 

• A common assumption underlying previous work has been that the number of photons 
from both signal and background in a given angular pixel are drawn from a Poisson 
distribution. In fact, as we illustrate in §11, this is not true in general. In particular, 
the blazar-produced photons are likely to be drawn from a probability distribution 
function (PDF) very different than Poisson. This opens the possibility of using the 
different underlying distributions to separate signal from background. Recently, a 
similar statistic has been studied for use in characterizing the signal of unresolved 



Galactic dark matter sub-halos 



26]. 



In this paper, we explore the efficiency of these techniques applied to pixel-statistics for 
extracting the gamma ray flux from cosmological dark matter annihilations. We derive a 
compact way to assess how effectively a given experiment can separate signal from back- 
ground using spectral information alone (§111) and then using both spectral and angular 
information ( §IVj) . In §V, we explore the information encoded in yet another potential dis- 



criminant: the probability distribution function (PDF) of counts. We make a simple attempt 
to understand the different distributions and find that there are both large advantages if 
one uses the correct distribution and considerable disadvantages if one assumes an incorrect 
distribution (§IV). A discussion and our conclusions are reported in §V. 



II. MODELS OF THE SIGNAL AND BACKGROUND 

Here, we describe simple models for the dark matter annihilation signal, for the back- 
ground from unresolved blazars, and the Galactic background. 



A. Cosmological Dark Matter Signal 

It has long been realized that, due to the dumpiness of virialized dark matter structures, 
the extragalactic dark matter annihilation signal is much larger than its naive expectation 
value from the average dark matter abundance in the universe j^] . The flux of gamma rays 
produced in dark matter annihilations throughout the cosmological volume is described by 

(av) c p\ t n 3 A 2 (*) 



= W_E_Px f dz{1 + z) 



h(z) 



x^(£ 7 (l + ^))e-^, (1) 

where (av) and mx are the annihilation cross section and mass of the WIMP. The spectrum 
of gamma rays per annihilation, dN 7 /dE y , further depends on the dominant annihilation 
channels. In this study, we consider the case of a 100 GeV WIMP which annihilates uniquely 
to W + W~ with cross section (av) = 3 x 10~ 26 cm 3 sec -1 , which in turn produce gamma 
rays through their decays. In Eq. ([Tj), p x denotes the average density of dark matter, A 2 (z) 
the average squared overdensity, r describes the estimated optical depth of the universe to 
gamma rays, Hq = 70 km/s/Mpc is the present value of the Hubble constant and h(z) = 



1 + z) 3 £l]\j + Q\ describes its evolution with redshift z in terms of the matter fraction, 
Om = 0.3, and cosmological constant, I^a = 1 — Qm (a flat universe is assumed). To calculate 



the flux of gamma rays from WIMP annihilations, we follow the procedure of Re 



assuming a universal halo profile either of the Navarro^ 
Moore et al. 



301 ] form. We adopt the Bullock et al. (31| convention for estimating halo 



renk and White (NFW) 



29| or 
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concentrations, which leads to enhancement factors of A 2 (0) = 1.15 x 10 5 and 1.18 x 10 6 for 
the two models, respectively. 

An important caveat is in order: Clearly, towards the Galactic Center this is not the 
dominant component of the diffuse dark matter signal, since the signal from the smooth halo 
of our Galaxy is larger. At high Galactic latitudes (which constitute the largest fraction of 
the solid angle), the signal which dominates depends on the degree of substructure surviving 



in the Milky Way llj. Calculations based on recent simulations [l3j suggest that the dark 
matter signal from galactic substructure dominate the (quasi-) isotropic background, at least 
for typical substructure distributions inferred from pure dark matter N-body simulations. 
Yet, quite a bit of uncertainty remains, especially since baryonic effects have not yet been 
included. Here, for simplicity, we consider only the extragalactic component, keeping in 
mind that for a given choice of the halo profile, this may underestimate the real contribution 
to the signal. 

B. Unresolved Blazars 

Over its mission, the EGRET experiment accumulated a catalog of 66 blazars (at high 



confidence) [32|, |33| . From the information contained in this catalog, it is possible to con- 
struct a model of the redshift distribution, luminosity function, and spectrum of these 
sources. In turn, such a model can be used to estimate the total flux of gamma rays expected 
to be produced by the large population of unresolved (typically fainter, or more distant) 
blazars. In this analysis, we adopt a blazar luminosity function based on the population 



study of Ref. [34J, and use a redshift distribution following the sub-mm/far-IR luminosity 



35| . We also adopt a universal spectral shape 



density associated with luminous IR galaxies 
of dN 1 /dE 1 oc E~ 2 - 2 . 

Although this model is broadly consistent with the properties of the blazars observed by 
EGRET, the limited sample size present in the EGRET catalog (and the limited amount of 
information available for each blazar) makes it difficult to construct such a model with much 
accuracy. This situation will be dramatically improved as Fermi begins to accumulate its own 
catalog of blazars. In particular, Fermi is expected to resolve ~ 10 3 blazars, providing a much 
larger sample with which to perform population studies. In fact, 104 blazars have alr eady 
been detected with very high confidence (> 10a) in the first 90 days of Fermi data |36| . 
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Furthermore, these observations will extend to much higher energies than those of EGRET, 
and will include blazars with lower luminosities and higher redshifts. These observations 
will enable the construction of a population model which will be capable of estimating the 
diffuse gamma ray spectrum from (unresolved) blazars with far greater accuracy than is 
currently possible. 

In Fig. [TJ we compare the diffuse gamma ray spectrum from unresolved blazars in our 
model with that from dark matter annihilations with the parameters assumed above. The 
flux from dark matter is shown for the case of both NFW and Moore et al. profiles. Note that 
only the normalization and not the spectral shape is affected by the choice of halo profile. 
Shallower dark matter halo profiles or a decrease in small-scale substructure would lower 
the signal, while any residual contribution from unresolved substructure at high galactic 
latitudes would boost it. A similar enhancement could result due to a larger cross section 
or additional small scale structures. 

Eq. (CQ) represents the average flux on the sky from cosmological dark matter annihilations. 
For any given experiment, this can be turned into the expected numbers of photons per pixel 
over a finite time. For example, imagine dividing half of the sky (the half least contaminated 
by the Galaxy) into iVp ix = 330,000 spatial pixels, each roughly (0.25 ) 2 , and counting the 
number of photons in each pixel accumulated over 5 years of observations with the Fermi 
satellite. Under the assumptions laid out above, Fermi would detect on average 0.06 photons 
per pixel (over 19,000 total photons over half of the sky) from cosmological dark matter 
annihilations, assuming an NFW profile. The mean count per pixel, in this case 0.06, does 
not tell the whole story, however. There is also the distribution from which photon counts 
in each pixel are drawn. Strictly speaking, neither the dark matter signal nor the blazar 
background are drawn from a truly Poisson distribution. Yet, the dark matter distribution 
is much more similar to Poisson, because there are many dark matter halos, most of which 
produce only one or no detectable photons over the duration of the experiment. Most halos 
generate zero photons, some produce one, few produce two, etc. 

The photon counts from blazars are drawn from a very different distribution, however, 
because only a small fraction of halos (those with aligned Active Galactic Nuclei) host 
blazars. Compared to dark matter halos, a larger fraction of these blazars are expected 
to produce many photons. Using information from the EGRET satellite, we can construct 
a model of blazar-produced photons and compare the distribution from which these are 
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FIG. 1: The cosmological diffuse s pec trum of gamma rays from dark matter annihilations and 



from unresolved blazars (from Ref. 3j] which may have suffered from incompleteness). We have 
considered a WIMP with a mass of 100 GeV, an annihilation cross section of (av) = 3 x 10 -26 
cm 3 sec^_, and which annihilates to W + W~ . Results are shown for two choices of the halo profile 
(NFW 



291 ] and Moore et al 



30]). For details regarding our blazar model, see the text. Also shown 
for comparison is the extragalactic diffuse flux observed by EGRET, as calculated in Ref. [1G[] , and 
an estimate of its fraction that will not be resolved by Fermi. 

drawn to a Poisson distribution. Note that here we are making two (probably unrealistic) 
approximations: (i) We are considering the case where the only background is due to blazars. 
While it is likely that emission from blazars makes up a large fraction of the isotropic flux, 
obviously this is a simplification, (ii) We are considering the dark matter signal as Poisson- 
distributed, which might be valid only for a fraction of the signal. Still, in order to illustrate 
the point, it is useful to work with these assumptions. In §V we shall come back discussing 
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qualitatively the impact of relaxing these approximations. 
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FIG. 2: The probability of observing iV 7 photons above 1 GeV in a (0.25°) 2 pixel in 5 years of Fermi 
observations. The Poisson distribution is normalized to give the same number of total photons. 
Note the large tail in blazar distribution compared with a Poisson distribution. 

In Fig. [2J we show the probability distribution for unresolved blazars in our model to pro- 
duce iV 7 detected photons in a given angular pixel of Fermi over 5 years. This is compared 
with a Poisson distribution which has the same number of expected photons, J2n~, A^ 7 P(A^ 7 ). 
The key point is that these two distributions are very different from one another; in par- 
ticular, the blazar distribution leads to many more pixels with many photons relative to 
the corresponding Poisson distribution. The total number of photons due to unresolved 
blazars in this model is 1.7 x 10 6 , nearly 100 times the number produced by dark matter 
annihilations using an NFW profile. 

In Fig. [3j we depict these distributions in two maps containing photons only from un- 
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resolved blazars. The photons in each pixel in the top map are drawn from the model 
distribution depicted in Fig. [2j There are many pixels with no photons (no blazars in that 
direction), but some pixels contain several hundred photons (pixels with more than 220 pho- 
tons are considered to be resolved and hence eliminated from the map). In contrast, in the 
bottom frame we show the map corresponding to photons drawn from a Poisson distribu- 
tion with the same number of photons per pixel as in the top map. The multiplicity in the 
Poisson distribution map is much more even: relatively few pixels with either no photons or 
with iV 7 > 10. This provides us with a new tool for discriminating the dark matter signal 
from background: the PDF of observed photons. 



C. Galactic Background 

Even far from the Galactic plane, the Galactic background is considerably larger than the 
dark matter signal so must be included to obtain realistic projections. A simple fit, proposed 
in [?| and calibrated on EGRET data, for the intensity of photons from the Galaxy as a 



37] 



function of energy and Galactic coordinates is 

I gf *(E,l,b) = N (l,b)I (E) (2) 

where 

I (E) = KT 6 (^L) ' cm-W^GeV- 1 , (3) 



and 



85.5 



Nq(1 b) = { V /l +( / /35)V 1 +[ fe /(l-l+0.022|/|)]2 ^ 

85 - 5 , + 0.5 1/1 < 30° 

v/l+(//35)Vl+(Vl-8) 2 ' ' - 

and both / and b are in degrees. 

This model predicts that Fermi will detect 6.1 x 10 7 photons above 1 GeV from the Galaxy 
over the course of five years of observations. We consider this model as an upper limit to the 
truly diffuse Galactic emission. In §sec:ani, we include this Galactic contribution and use 
both angular and spectral information to see how well the cosmological dark matter signal 
can be extracted. We leave the spatial template and the spectral index fixed, and use only the 
normalization as a free parameter. This has a physical motivation: the spatial template — 
while realistically different from the above toy-model — will be obtained by high-statistics 
sub-GeV observations. Since its shape depends on the product of density of interstellar 
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FIG. 3: Top: Map of counts from unresolved blazars using blazar model described in the text. 
Bottom: Map of the same number of total couiifis drawn from a Poisson distribution. 



material times cosmic-ray density along the line of sight, one does not expect it to change 
with energy. Also, the spectral index 2.7 is more or less what is observed in cosmic ray 
protons of 10-10000 GeV energy (which generate the photons in the energy range of interest), 
and photons produced by 7r via spallation follow the same power-law as the primaries. 

In the next section, we explore the power of spectral discrimination, then add in angular 
discrimination, and finally turn to discrimination via distributions in a simple 2-component 
model. 

III. SPECTRAL DISCRIMINATION 

One way to extract the dark matter annihilation signal from astrophysical backgrounds 
is to exploit differences in the spectrum of each component. We first focus on the simple ex- 
ample where the shapes of the spectra are known and we fit the data for the two amplitudes. 
Generalizing to the more realistic case of unknown shape parameters is straightforward, and 
we illustrate this at the end of this section by allowing the slope of the blazar spectrum 
and the mass of the dark matter particle to vary. In this section, we neglect all angular 
information and treat both signal and background as isotropic on the sky. We break the 
gamma ray sky up into N e different energy bins (we will use N e = 25 bins logarithmically 
spaced in energy between 1 GeV and 300 GeV). For now, we assume that the likelihood of 
observing (N±, N 2 , . . . iVjvJ photons in each of the energy bins is Gaussian: 



where N s is the total number of expected counts due to the (dark matter) signal in all bins 
and /? the corresponding spectral shape normalized so that J2i ft = ^-) an d N b and j\ are 
the analogous quantities for the background. The noise in the i th bin is <Xj. To project the 
errors on the two free parameters in this model (N s and N b ), we compute the curvature of 
the likelihood function, or the 2x2 Fisher matrix, 






^ftfr 



(6) 
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where /x, v run over signal and background. Consider the case where the noise is Poisson 
noise so that of = N h j\ + N s ff. Then the Fischer matrix simplifies to 

rp _ Ji Ji /<7\ 

" u h Nb f! + Ns fi' 

The F ss component of this matrix is the inverse of the square of the l-a projected error on 
the number of signal events assuming the number of background events, N b , is known in 
advance. This is called the unmarginalized error on N s : 



{AN.) 



unmarg 



iV e r« r« 



-fs -fs 

ST~^ Ji Ji /o\ 

[ti Nh fi+ Ns fi\ ' 



More relevant is the error when jV& is allowed to vary freely. In that case, the marginalized 
error on N s is [(F~ 1 ) ss \ 1 ^ 2 . Explicitly, 

(AiV s ) marg = (A ^ } — rs , (9) 



r 



where r measures the extent to which the two spectra are orthogonal to one another: 



(10) 



V F ss F bb 

If the two spectra are very different, then r is close to zero, and it is easy to extract 
the signal from the background. Quantitatively, in that limit, (AiV s ) mar g = (AiV s ) unmarg . 
Notice from Eq. (|8l) that this error scales as y/N' as naively expected (e.g., significance as 
defined in Ref. [38]), with the shape functions providing the precise numerical coefficient. 
If the spectra are similar, though, the marginalized error can become arbitrarily large as r 
approaches one. Eq. ([9]) offers a compact way to assess how effectively a given experiment 
can separate signal from background using spectral information alone. 

In the idealized case in which the spectral shape and normalization of the diffuse back- 
ground from unresolved blazars are known in advance (from a detailed population study of 
resolved blazars, for example), we find that this technique can be used to determine the num- 
ber of signal events from five years of observation by Fermi to an accuracy of AiV s = 1270. 
This is only 2% tighter than the Poisson error AN S = ^/N, = 1289. So if the background 
photons counts were known exactly, spectral information would add little discriminatory 
power. In the absence of such information, however, we are forced to marginalize over the 
normalization of the background. In that case, Eq. (Q projects that the error goes up to 
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(AiYs) marg = 6277. A simple way to interpolate between these two extremes - marginal- 
ized and unmarginalized errors - is to introduce a prior on the background number counts. 
This corresponds to multiplying the likelihood in Eq. §5$) by exp —(Nj, — Nb) 2 /^ <J N b , or 
equivalently by adding 1 / a 2 Nb to the bb component of the Fisher matrix. 
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FIG. 4: The projected 1-sigma error on the number of events from dark matter annihilations as 
a function of how well known the background is for 5 years of Fermi observations. A Gaussian 
prior is placed on the number of background events with variance a\ . The topmost line depicts 
the result if no spectral information is used; the middle line if spectral information from 25 bins 
is used; and the bottom horizontal line simply extends the "fixed-background" (corresponding to 
o 7v b = result). Poisson noise - the square root of the number of events - is depicted by the 
vertical arrow. 

Fig. H] depicts the errors on N s as a function of the width of the prior, <j^ b (ie. the 
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uncertainty on the background flux). If a^ b is very small, much smaller than N b , then 
the unmarginalized error is obtained. As the prior gets looser (larger cr/vj, however, the 
projected error on N s gets larger. The middle (dashed) curve in Fig. H] illustrates the 
transition from the unmarginalized error to the marginalized result, about 4 times larger. 
The upper curve illustrates that, with no spectral discrimination, the error on N s scales 
simply as ajy b . The reality check here is that N b = 1.7 x 10 6 , so a Nb ~ 1000 - roughly the 
transition region - corresponds to knowing background counts to better than 0.1%, clearly 
impossible. We thus conclude that, even with a very detailed blazar model derived from 
future population studies, we will not be able to predict the background flux with sufficient 
precision to make use of the unmarginalized error as described Eq. ([8]). In all practical cases, 
analysts will need to marginalize over the background flux. 

It is straightforward to vary other parameters, such as the spectral index of the blazar 
spectrum (while still assuming a power law spectrum) and the mass of the dark matter 
particle. The key ingredients in computing the Fisher matrix are the derivatives of the 
number of events with respect to, now, the four parameters, taken to be ln(iV s ), ln(iV b ), 
ln(mDM), and a, the slope of the background spectrum. These derivatives are depicted in 



Marginalizing over the three other parameters (N b , a, jit-dm) leads to a 1-sigma error 
AN S = 8846 (as opposed to 6277 found when the spectral index is fixed to -2.2 and the 
mass to 100 GeV). Considering that an NFW profile and a cross section of av = 3 x 10~ 26 
cm 3 sec -1 leads to 19,400 signal events, the 2-sigma upper limit after 5 years would be 
~ 2.7 x 10 _26 cm 3 sec -1 , consistent with the results of Ref. 01. 



IV. ANGULAR DISCRIMINATION 

Photons originating from cosmic rays incident on our Galaxy are likely to be far more 
numerous than those coming from outside the Galaxy. Indeed, in the model described in 
$U\ Fermi will detect 6.1 x 10 7 Galactic photons over the course of 5 years over the whole 
sky. This is almost 20 times larger than the number of photons produced by unresolved 



Fig. El 




14 




1 10 100 

E (GeV) 



FIG. 5: The derivative of the total number of events in each of 25 energy bins with respect to 4 
parameters: ln(./V s ), where N s is the number of photons from dark matter an- nihilations; ln(iV 6 ), 
with N b the number of events from un- resolved blazars; a, the slope of the blazar spectrum; 
and ttt-dM) the dark matter mass. These derivatives are evaluated around the fiducial values 
(N s ,N b , a, todm) = (1-9 x 10 4 , 1.7 x 10 6 , -2.2, 100 GeV). 

blazars and over a thousand times more than the extragalactic dark matter signal 1 . Spectral 
discrimination alone will clearly not be sufficient to eliminate this background. Here we 
include the different angular distributions of the Galactic and extragalactic components to 
project limits on the number of dark matter-produced events. 

To include both angular and spectral information, we generalize the argument of the 

1 Recall that the numbers quoted in ^IIII - 1.7 x 10 6 and 19,000 - were for only half the sky. In this section 
we double these since we use the full sky. 
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exponential in Eq. (JSJ) to 



% AW (AT. a - N s f? - N»f» - ^J gal (^, l a , b a ) 

x =EE- -2 -• (11) 

i=l o=l U ia 

Here, in addition to the sum over energy bins, we sum over iVpi x angular pixels, each labeled 
with (l a ,b a ). The model of §TT] is multiplied by a normalization factor n 9 , equal to one in 
the model but allowed to float in our fit. The likelihood function (or % 2 ) therefore now 
depends on five parameters: two characterizing the dark matter signal (amplitude N s and 
mass widm); two characterizing extragalactic backgrounds (amplitude N b and slope a); and 
one for the normalization of the Galactic background n 9 . 

To project constraints on these parameters, we compute the (now 5-dimensional) Fisher 
matrix: 

1 d 2 x 2 

F ^ = 2dp^ (12) 
where p M are the five parameters. For example, with p 5 = n 9 , taking the derivatives leads to 

^55 -EE s 1 a) ) ■ (i3) 

i=l a =l \ ° ia ) 



The 1-sigma limit on the number of signal events, AiV s = y(F _1 )n is now equal to 34,000, 
very close to the full sky NFW signal of 39,000. The 2-sigma upper limit on the annihilation 
cross section becomes 5.3 x 10~ 26 cm 3 sec -1 , so the Galactic photons pollute even regions 
far from the Galactic plane, thereby degrading the upper limit by a factor of 2. 

The full Fisher matrix contains interesting information about the shape of the likelihood 
function in the full five dimensional parameter space. One way to explore this structure is 
to generalize Eq. ( TlQl) and consider the 5x5 dimensional correlation matrix with elements 

• F ^ (14) 



F F 



This is depicted in Fig. [61 Note the strong correlation between the amplitudes of the isotropic 
components N s and N b and the strong anti-correlation between ttt-dm and a expected from 
the similarity in the derivatives in Fig. [5j 



V. DISTRIBUTION DISCRIMINATION 



As the distribution of photons from dark matter annihilations is expected to be close to 
Poisson, and the background from blazars is not, the natural question to ask is whether the 
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FIG. 6: The projected correlation matrix for a set of parameters used to fit 5 years of Fermi data. 
Note the strong correlation between N s , the dark matter produced photons, and the unresolved 
blazar background amplitude N b . Similarly, the Galactic background is correlated with N s : n$ = 
0.65. Thus the Galactic photons degrade Fermi's sensitivity to this dark matter signal. 

signal can be extracted from such backgrounds by exploiting this distinction. A complete 
answer to this question requires an understanding of the PDF's of all backgrounds and signals 
anf folding in constraints from spectral and angular information such as those developed 
above. Here we take a first step in this direction by considering a toy model with just two 
components: extragalactic dark matter and unresolved blazars. Further we assume that the 
PDF of dark matter-produced photons is Poisson. As a preliminary illustration, note that 
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with an average of 0.06 photons from dark matter annihilations in each (0.25 ) 2 angular 
pixel, fewer than 0.5% of all pixels will contain more than one photon from dark matter. 
In contrast, 86% (71%) of all photons from blazars will fall in pixels with 10 (20) or more 
photons. Thus, by simply throwing away the photons in angular pixels with many photons, 
one can potentially remove the majority of the background from blazars, while retaining 
nearly all of the signal from dark matter. 

Quantitatively, the probability of observing {Ni, N 2 , . . .} photons in a set of iVpi x pixels 
is given by 



where Pb is the probability distribution for blazar photons, P s is the probability distribution 
for dark matter photons, and N s is the total number of signal photons expected (which scales 
with (<?v)). N s is the only free parameter in the model. P s depends on the mean number of 
expected events in the pixel, equal to N s /N P i x . Here we do not use spectral information, so 
Ni simply denotes the total number of photons detected in spatial pixel 1. The information 
contained in this distribution could be combined with spectral (and angular) information in 
a full likelihood analysis. 

The standard assumption is to take both Pf, and P s to be Gaussian 2 , so maximizing the 
likelihood reduces to minimizing the x 2 '- 



where N b is the total number of background photons and the denominator assumes that 
only Poisson noise is relevant. For the sake of this exercise, let us assume that N b is known. 
Under this assumption 3 , minimizing the \ 2 leads to AN S = \J N b . 

But what if the background counts were not drawn from a Gaussian distribution, but 
rather from the distribution shown in Fig. [2]? How would this affect the results? Would an 
analyst who knew (or could estimate) the true distribution be able to exploit this information 
to extract the signal more effectively? Conversely, would an analyst ignorant of the true 

2 This is virtually equivalent to taking the distributions to each be Poisson. 

3 When the uncertainty in N b is included, AN S will go up as we saw in §1111 The goal here though is 
to understand how much discrimination power lies in the different distributions, and we need a baseline 
prediction against which to judge the power, so we settle for fixed iV&. 



P [{N U N 2 , . . .}\N S ] = J] T, P b( N i -j)P s (j\N s /N pix ), 



(15) 



i=l j=0 




i=l 



(16) 
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distribution who assumed a Gaussian distribution be led to false conclusions? To answer 
these questions, we generated counts in iVp ix = 330, 000 pixels (roughly (0.25 ) 2 each over half 
the sky) from the "true" distributions (Poisson for photons from dark matter, and that shown 
in Fig.[2]for photons from blazars) and then analyzed these counts in two different ways in an 
attempt to extract the one free parameter, N s . Then we repeated this exercise multiple times 
to accumulate statistics on how accurate each analysis technique was. The first technique 
analyzed the simulated data using the correct probability distributions in Eq. ffl5|) . while 
the second assumed (incorrectly) that the backgrounds were also drawn from a Poisson 
distribution. In each case, we tabulated the likelihood function C(N S ) = P [{Ni, N 2 , . . -}|^V" S ] 
as a function of N s and computed the central 68% confidence region. As expected, both 
analysis techniques retrieved the correct value of N s on average. The correct technique 
reported a \-o error on N s of 331; the Gaussian technique reported a 1-cx error of 1291. 
This is to be compared with the Poisson (unmarginalized) error of AN S = 1289. We thus 
conclude that using the correct distribution leads to an improvement in sensitivity by a 
factor ~4! 

The corollary of the notion that knowing the underlying distributions is useful for ex- 
traction is the danger that not knowing the distributions will lead to errors. In fact, this 
happens when the incorrect distribution is assumed. Consider the results of the 10 runs 
depicted in Fig. [7J Each red box represents one Monte Carlo run analyzed with the two dif- 
ferent likelihoods. The position of the box and the associated error bar along the horizontal 
axis denotes the estimate of N s and its 1-a error using the correct likelihood of Eq. ( fl5j) . 
The position of a box along the vertical axis, in contrast, denotes the estimate obtained 
using the (incorrect) Gaussian likelihood, similar to Eq. (fTBT) . Note that the spread in the 
measurements using the correct estimator is comparable to the error bars. However, the 
spread in extracted values using the incorrect distribution is larger than the reported error 
bar by approximately an order of magnitude. This is a particularly pernicious systematic 
error: if analysts unknowingly use the incorrect underlying distributions, the resulting esti- 
mates for AN S will be much smaller than the true uncertainty. This result argues that, in 
order to optimally extract the dark matter signal, we need to understand the PDFs of both 
background and signal. 
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FIG. 7: Constraints on the number of events from dark matter annihilation from ten different sim- 
ulations. Values along the x-axis were analyzed using the correct likelihood function in Eq. (|15p . 
from which the simulations were drawn. Values along the y-axis were obtained by assuming (in- 
correctly) that the background events were drawn from a Gaussian distribution. Note the different 
scales along each axis. The black point is the true value and the error bars in each direction on 
that point represent Poisson errors in the background counts. Note that estimating N s using the 
correct distribution leads to error bars smaller than Poisson and estimating it using the incorrect 
distribution leads to a large spread in the results. 

VI. DISCUSSION AND CONCLUSIONS 

In this article, we have studied the possibility of separating the cosmological gamma ray 
background produced in dark matter annihilations from the flux from unresolved blazars by 
using spectral information, angular information, and the differing probability distribution 
functions (PDFs). Using only spectral information, the resulting error on the amplitude 
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of the dark matter signal, given in Eq. (Q, is a simple function of the spectra and the 
energy bins in the experiment. Angular information can/should also be incorporated to 
separate out the Galactic background. The probability distribution of pixel-counts from 
which the background and signal are drawn is also a potential discriminator. In particular, 
we have shown that the dark matter signal can be extracted from a much larger background 
making use of these distributions. In fact, the extraction was even more effective than that 
obtained using spectral information, at least in the case considered here, providing a tool 
complementary to multipole analyses proposed in the recent past. The dangerous corollary 
of this result is that using an incorrect PDF can lead to a systematic error in the signal 
extraction, potentially much larger than the corresponding statistical error. 

The analysis presented here has assumed two important approximations: (i) An isotropic 
background resulting solely from unresolved blazars; and (ii) Photons from cosmological 
dark matter annihilations drawn from a Poisson distribution. It is currently believed that, 
at least well above one GeV, blazars are likely to be the main contributors to the unre- 



solved gamma ray background (for a critical discussion of this point, see 4CJ, |4l|, |42j) . Other 



backgrounds are also expected to be present including, for example, the "guaranteed" con- 



tribution from ordinary galaxies 



43( or the flux from byproducts of ultra-high energy cosmic 



ray interactions [44( (for a review, see 42j). Depending on energy, these sources are expected 
to contribute from ~ 0.1% to ~ 10% of the EGRET background, and have a distribution 
closer to that from dark matter than from blazars. While the Galaxy contribution has a 
spectral shape quite different from the expected dark matter signal, the background from 
extragalactic cosmic ray interactions would be quite degenerate with it, making the method 
presented here unlikely to be successful in identifying the dark matter component if it is 
below a few percent of the EGRET diffuse flux. One might turn the argument around and 
conclude that, even in absence of a dark matter signal, the method presented here might 
be useful in studying sub-dominant, quasi-isotropic components of the diffuse signal. The 
second approximation mentioned above should prove easier to address. We can study the 



PDF of the dark matter signal as was done for Galactic sub-halos in Ref. 26] to enhance 
the separation power. Furthermore, as population studies from Fermi become available, a 
more realistic model of unresolved blazars (as well as other potential gamma-ray sources) 
can be constructed. 

As a final remark, let us stress that these considerations could significantly improve the 
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bounds on decaying dark matter candidates as well. For a given particle physics scenario, 
the assumption of Poisson-distributed cosmological emission should be an even better ap- 
proximation; furthermore, the signal does not suffer from uncertainties of halo profiles and 
sub-structures. Further, in this case, the isotropic component is even more important for 
detection, since for decaying dark matter one does not expect a much larger signal from the 
Galactic Center region. 

This work has been supported by the US Department of Energy, including grant DE- 
FG02-95ER40896 and by NASA grant NAG5- 10842. We thank Gianfranco Bertone, Savvas 
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